CDS
Accession Number | TCMCG075C01529 |
gbkey | CDS |
Protein Id | XP_017985209.1 |
Location | complement(join(7977618..7977728,7979501..7979837,7980331..7980542,7980695..7980805,7982381..7982488,7983979..7984088,7984364..7984473,7985305..7985513,7986275..7986439,7989261..7989379,7989468..7989555,7989682..7989787,7990402..7990490,7992384..7992527,7994250..7994342,7994502..7994568,7994717..7994829,7994942..7995037,7995387..7995542)) |
Gene | LOC18611948 |
GeneID | 18611948 |
Organism | Theobroma cacao |
Protein
Length | 847aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018129720.1 |
Definition | PREDICTED: beta-galactosidase 10 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | beta-galactosidase |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE | - |
KEGG_ko | - |
EC | - |
KEGG_Pathway | - |
GOs |
GO:0003674
[VIEW IN EMBL-EBI] GO:0003824 [VIEW IN EMBL-EBI] GO:0004553 [VIEW IN EMBL-EBI] GO:0004565 [VIEW IN EMBL-EBI] GO:0005575 [VIEW IN EMBL-EBI] GO:0005618 [VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005773 [VIEW IN EMBL-EBI] GO:0008150 [VIEW IN EMBL-EBI] GO:0009505 [VIEW IN EMBL-EBI] GO:0009628 [VIEW IN EMBL-EBI] GO:0015925 [VIEW IN EMBL-EBI] GO:0016787 [VIEW IN EMBL-EBI] GO:0016798 [VIEW IN EMBL-EBI] GO:0030312 [VIEW IN EMBL-EBI] GO:0043226 [VIEW IN EMBL-EBI] GO:0043227 [VIEW IN EMBL-EBI] GO:0043229 [VIEW IN EMBL-EBI] GO:0043231 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0050896 [VIEW IN EMBL-EBI] GO:0071944 [VIEW IN EMBL-EBI] GO:0080167 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGAAGCTCTTCCTTCCTCTCTTGTTTTGTTTCTTTACTTTGTTCAACTCTTGTTCTGCTGCCAATGTAACTTATGATCGCCGCTCTCTCATCATTGATGGCCAACGCAAGCTCCTCATTTCTGCTGCCATTCATTACCCCCGCAGCGTTCCTGGGATGTGGCCAGGGCTGGTTCAAACAGCTAAGGAAGGAGGGGTTGATGTCATTGAATCTTATGTGTTTTGGAATGGGCATGAGCTTTCTCCAGGAAAATACAATTTTGAAGGACGATATGATCTGGTCAAGTTTGTAAAGATTGTTCAGCAAGCTGGGATGTATATGATTCTTCGAATTGGCCCATTTGTAGCAGCTGAATGGAATTTTGGGGGGGTACCTGTCTGGTTGCACTATGTCCCTGGATCTGTGTTTCGATCTGATAATGAGCCCTTCAAGTATTACATGCAGAAGTTCATGACATTCATAGTGAACCTTATGAAGCAAGAGAAGCTTTTTGCATCACAAGGAGGTCCAATCATCATGGCCCAGGTGGAAAACGAATATGGATTTTATGAACAATATTATGGAGAAGGGGCAAAACGATATGTCACGTGGGCTGCTAAAATGGCAGTTTCTCAGAATATTGGAGTACCTTGGATAATGTGTCAGCAAGATGATGCTCCTGATCCTGTGATTAATACTTGTAACTCCTTTTACTGTGATCAATTCAAACCTAATTCTCCAAACAAGCCCAAAATTTGGACTGAGAACTGGCCTGGATGGTTTAAAACATTTGGGGCCAGAGATCCTCACAGGCCACCTGAGGATATTGCTTTTTCTGTTGCTCGTTTCTTTCAGAAAGGTGGAAGTGTACAAAATTACTACATGTATCATGGCGGAACGAATTTTGGTCGAACATCGGGTGGACCTTTCATTACAACAAGTTATGATTATGAAGCACCTATTGATGAGTATGGGTTACCTAGGCTTCCAAAATGGGGACACCTAAAGGAACTCCATAGAGCTATAAAGTTAAGTGAGCATGCATTGCTGAAGAGTGAACCAACTAATTTGTCACTAGGTCCTTCCCTAGAGGCTGATGTTTATGATGATGGTTCAGGAGCCTGTGCTGCCTTTCTTGCTAACATGGATGATAAAACTGACAAGAATGCAGTGTTCCGGAATGTGTCATATCACCTGCCTGCATGGTCAGTCAGCATTCTGCCTGACTGTAAGAATGTTGTATTTAACACGGCAAAGATAAGTTCCCAGGCCTCTGTGGTAGAAATGTTACCCGAGGAGTTGCAGCCATCAGTGGCATTACCCAGTAAAGACTTGAAAGCTCTAAAATGGGATATATTTGTGGAGAATGCTGGAATTTGGGGAGCAGCTGACTTCACTAAAAATGGTTTTCTGGATCATATAAATACCACAAAAGATACTACTGACTACCTCTGGTATACAACAAGTATAATTGTTGGTGAAAATGAAGAATTTCTGAAGAAGGGAAGCCATCCAGTTCTTCTTATTGAGTCAAAGGGTCATGCTCTTCATGCTTTTGTGAATCAGGAACTTCAAGGTAGTGCTTCTGGAAATGGCTCGCATTCGCCCTTCAAATTTGAGAATCCAATTTCTCTCAAGGCAGGGAAGAATGAAATTGCACTGTTAAGCATGACTGTGGGCCTACAAAATGCAGGTGGGTTATATGAATGGGTAGGAGCAGGACTAACAAGTGTGAAGATTGAGGGGCTCAACAATGGAACCATAGATTTGTCTATGTCTAGCTGGACCTACAAGATTGGATTGCAAGGAGAACACTTGGGTCTATACAAGCCAGAAATTTTGGCTGGTGTAAATTGGGTGTCAACCTCAGAACCACCAAAAAATCAGCCTCTGACGTGGTACAAGGTTGTTGTGGATCCACCATCAGGAGATGAACCAGTTGGACTGGACATGATTCATATGGGGAAAGGTCTAGCCTGGTTAAATGGAGAAGAGATCGGAAGATACTGGCCAATTAAAAGTTCTAAACATCTTGAGTGTGTACAGGAATGTGATTACAGAGGCAAATTTTTCCCAGACAAATGCCTTACTGGTTGTGGAGAACCAACACAAAGATGGTATCATGTTCCACGTTCTTGGTTCAAGCCATCTGGAAACATTTTGGTGATCTTCGAGGAAAAGGGTGGAGATCCAACAACAATTAGATTCTCAAAACGCAAAACATCAGGCCTATGTTCTCACATTTCTGAGGACTACCCTATGGTTGACCAGGAGTCAATATCTAAAGATGGAAATGGAAATGACAAAACCAGACCAACTGTCCATCTAAAGTGCCCCAAAAATACTTGGATATCTAATGTGAAATTCGCTAGCTATGGAAATCCAACAGGAAGGTGTGGGTTGTACAGCATGGGGGACTGCCATGATCCTAACTCAACATTTGTGGTTGAAAAGGTCTGCCTAGGTAAAAATGAGTGTGCCATAGAACTAACAGAAGAGAAGTTTGATAAGAGCTTGTGTCCTGGTACTACGAAGAAACTTGCAATTGAAGCAGTTTGCAGCTAA |
Protein: MKLFLPLLFCFFTLFNSCSAANVTYDRRSLIIDGQRKLLISAAIHYPRSVPGMWPGLVQTAKEGGVDVIESYVFWNGHELSPGKYNFEGRYDLVKFVKIVQQAGMYMILRIGPFVAAEWNFGGVPVWLHYVPGSVFRSDNEPFKYYMQKFMTFIVNLMKQEKLFASQGGPIIMAQVENEYGFYEQYYGEGAKRYVTWAAKMAVSQNIGVPWIMCQQDDAPDPVINTCNSFYCDQFKPNSPNKPKIWTENWPGWFKTFGARDPHRPPEDIAFSVARFFQKGGSVQNYYMYHGGTNFGRTSGGPFITTSYDYEAPIDEYGLPRLPKWGHLKELHRAIKLSEHALLKSEPTNLSLGPSLEADVYDDGSGACAAFLANMDDKTDKNAVFRNVSYHLPAWSVSILPDCKNVVFNTAKISSQASVVEMLPEELQPSVALPSKDLKALKWDIFVENAGIWGAADFTKNGFLDHINTTKDTTDYLWYTTSIIVGENEEFLKKGSHPVLLIESKGHALHAFVNQELQGSASGNGSHSPFKFENPISLKAGKNEIALLSMTVGLQNAGGLYEWVGAGLTSVKIEGLNNGTIDLSMSSWTYKIGLQGEHLGLYKPEILAGVNWVSTSEPPKNQPLTWYKVVVDPPSGDEPVGLDMIHMGKGLAWLNGEEIGRYWPIKSSKHLECVQECDYRGKFFPDKCLTGCGEPTQRWYHVPRSWFKPSGNILVIFEEKGGDPTTIRFSKRKTSGLCSHISEDYPMVDQESISKDGNGNDKTRPTVHLKCPKNTWISNVKFASYGNPTGRCGLYSMGDCHDPNSTFVVEKVCLGKNECAIELTEEKFDKSLCPGTTKKLAIEAVCS |